Sponsored by Deepsite.site

Vision Mcp Server | 图片分析 Mcp

Created By
Markusbetter2 months ago
This MCP addresses the visual recognition limitations of text-based models by enabling accurate image description and identification, making it excellent for AI-assisted reference design interface analysis. It currently supports dropping links into the dialog box or placing images in the project folder for recognition. The tool can be integrated with MCP platforms like Claude Code, Cline, and Trae. Beyond programming applications, it also provides visual recognition capabilities for models that lack native image processing functionality. For visual models, users can select their preferred model from ModelScope community and replace it during MCP configuration setup. 📱 Daily Use Cases: Send screenshots to directly identify errors or issues Share image links or place screenshots in the project folder for AI-assisted layout optimization Submit product image links to generate promotional copy 该mcp可以解决文字模型图片识别的视觉的问题,可以准确识别描述图片,用来给AI看参考设计界面很nice~ 目前支持丢链接到对话框,以及把图片放到项目文件夹进行识别。 支持加入到Claude Code,Cline和Trae等mcp工具中。 除了编程外,如果你使用的模型本身不支持视觉图片识别,也可以使用~ 视觉模型可以自己去魔搭社区选一个自己喜欢的,在填写mcp配置的时候替换即可 📱 日常使用场景 - 截图发过去,直接告诉哪里出错了 - 丢过去一个图片链接或者截图放到项目文件夹内,让AI帮忙优化布局 - 发个产品图链接,让AI写推广文案
Content

Vision MCP Server | 图片分析 MCP

English | 中文


中文

一个用于图片分析的 MCP (Model Context Protocol) 服务器,支持图片内容分析和描述。 例如当你在客户端的模型只支持文字输入,这时你可以使用视觉模型mcp来弥补。 这个项目采用了魔搭社区免费的视觉模型Qwen3-VL-30B-A3B-Instruct(你也可以在配置中,使用魔搭社区自行更换为自己想要的视觉模型)。

功能特点

  • 支持本地图片文件和在线图片 URL
  • 基于魔搭社区 AI 模型的智能图像分析
  • 完全兼容 MCP 协议
  • TypeScript 支持,提供完整的类型定义

安装

方式一:使用 npx(推荐)

无需预先安装,在客户端填写以下内容npx 会自动下载并运行最新版本:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

方式二:全局安装

npm install -g vision-mcp-server

然后在客户端配置中:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "vision-mcp-server",
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

方式三:本地安装

npm install vision-mcp-server

然后在客户端配置中:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "node",
      "args": ["node_modules/vision-mcp-server/dist/index.js"],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

环境变量配置

在使用前,需要设置以下环境变量:

  • MODELSCOPE_TOKEN: 魔搭社区的 API 密钥(必需)
    • 获取方式:访问 魔搭社区 → 个人中心 → API令牌
  • MODELSCOPE_MODEL: 使用的模型名称(可选,默认为 "Qwen/Qwen3-VL-30B-A3B-Instruct")
    • 支持其他视觉模型,如:Qwen/Qwen2-VL-7B-Instruct

使用示例

// 分析本地图片
{
  "name": "analyze_image",
  "arguments": {
    "image": "/path/to/your/image.jpg",
    "prompt": "请描述这张图片的内容"
  }
}

// 分析在线图片
{
  "name": "analyze_image",
  "arguments": {
    "image": "https://example.com/image.jpg",
    "prompt": "这张图片中有哪些物体?"
  }
}

API 参考

analyze_image

分析图片内容并提供详细描述。

参数:

  • image (string): 图片 URL 或本地文件路径
  • prompt (string, 可选): 对图片的问题或分析要求,默认为 "请描述这张图片的内容"

返回: 图片内容的详细文本描述。

开发

构建

npm run build

测试

npm test

贡献

欢迎提交 Issue 和 Pull Request!

许可证

MIT

更新日志

1.0.0

  • 初始版本发布
  • 支持图片分析功能
  • 兼容 MCP 协议

English

A Vision Analysis MCP (Model Context Protocol) Server that supports image content analysis and description.

Features

  • Support for local image files and online image URLs
  • Intelligent image analysis based on ModelScope AI models
  • Full compatibility with MCP protocol
  • TypeScript support with complete type definitions

Installation

No need to pre-install, npx will automatically download and run the latest version:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Option 2: Global Installation

npm install -g vision-mcp-server

Then in your client configuration:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "vision-mcp-server",
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Option 3: Local Installation

npm install vision-mcp-server

Then in your client configuration:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "node",
      "args": ["node_modules/vision-mcp-server/dist/index.js"],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Environment Variables Configuration

Before using, you need to set the following environment variables:

  • MODELSCOPE_TOKEN: ModelScope API key (required)
    • Get it from: ModelScope → Profile → API Token
  • MODELSCOPE_MODEL: Model name to use (optional, default is "Qwen/Qwen3-VL-30B-A3B-Instruct")
    • Supports other vision models, such as: Qwen/Qwen2-VL-7B-Instruct

Usage Examples

// Analyze local image
{
  "name": "analyze_image",
  "arguments": {
    "image": "/path/to/your/image.jpg",
    "prompt": "Please describe the content of this image"
  }
}

// Analyze online image
{
  "name": "analyze_image",
  "arguments": {
    "image": "https://example.com/image.jpg",
    "prompt": "What objects are in this image?"
  }
}

API Reference

analyze_image

Analyze image content and provide detailed description.

Parameters:

  • image (string): Image URL or local file path
  • prompt (string, optional): Question or analysis requirement for the image, default is "Please describe the content of this image"

Returns: Detailed text description of the image content.

Development

Build

npm run build

Test

npm test

Contributing

Issues and Pull Requests are welcome!

License

MIT

Changelog

1.0.0

  • Initial release
  • Image analysis support
  • MCP protocol compatibility

Server Config

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
DeepChatYour AI Partner on Desktop
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
WindsurfThe new purpose-built IDE to harness magic
Amap Maps高德地图官方 MCP Server
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Tavily Mcp
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
CursorThe AI Code Editor
Serper MCP ServerA Serper MCP Server
Playwright McpPlaywright MCP server
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
ChatWiseThe second fastest AI chatbot™
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。